CLAIMS 



1 1 A method, comprising the steps of: 

2 decoding a macroinstruction of a computer, the decoding of the macroinstruction 

3 generating a number of iterations of a sequence of one or more microinstructions, including: 

4 a pattern of microinstructions implementing a basic operation, and 

5 a branch instruction predicted not taken. 

6 on detecting that an iteration completes operation of the macroinstruction, adding a 

7 marker indicating the end of the macroinstruction to a microinstruction in the pipeline 

8 downstream of the instruction decoder; 

9 after reaching a termination condition of the macroinstruction, partially executing an 

10 iteration beyond the termination, the partial execution committing at least one side-effect to 

1 1 an architecturally-visible resource of the computer, and raising an exception to transfer 

12 control to a second microinstruction stream; and 

13 in a second microinstruction stream, unwinding the side-effects committed by the 

14 post-termination iteration. 

1 2. A method comprising the steps of: 

2 decoding a macroinstruction of a computer, the decoding of the macroinstruction 

3 generating a number of iterations of: 

4 a pattern of microinstructions implementing a basic operation, and 

5 a branch instruction predicted not taken. 



3. The method of claim 2, 

on detecting that an iteration completes operation of the macroinstruction, adding a 
marker indicating the end of the macroinstruction to a microinstruction in the pipeline 
downstream of the instruction decoder. 
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4. The method of claim 2, further comprising the steps of: 

partially executing a loop iteration beyond the termination condition of a loop of a 
decoded microinstruction stream, the partial execution committing at least one side-effect to 
an architecturally-visible resource of the computer; 

raising an exception to transfer control to a second microinstruction stream; 

in the second microinstruction stream, unwinding the side-effects committed by the 
post-termination iteration. 

5. The method of claim 2, wherein the microinstruction set is architecturally 
exposed to programs fetched from a memory of the computer. 

6. The method of claim 5, wherein instructions of the microinstruction set are 
managed by a memory management unit between a main memory of the computer and one or 
more cache levels. 

7. A computer, comprising: 

an instruction decoder designed to decode macroinstructions into microinstructions 
for execution in an instruction pipeline on a computer, and for at least one macroinstruction, 
the decoding of the macroinstruction generating a number of iterations of: 

a pattern of microinstructions implementing a basic operation, and 

a branch microinstruction predicted not taken. 

8. The computer of claim 7, further comprising: 

instruction fetch and execution circuitry designed to fetch and execute instructions in 
both the macroinstruction set and the microinstruction set, each of the two instruction sets 
including store instructions to write data to a memory of the computer; 

store monitoring circuitry designed to monitor the store instructions and to invalidate 
any copies of a datum in memory overwritten by the store instructions, including copies of 
instructions in any instruction cache in the instruction set other than the instruction set of the 
current store instruction. 
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9. The computer of claim 7, further comprising: 

execution circuitry designed to execute an instruction calling for waiting to allow a 
pipeline to drain and to set bits of a floating-point control word to values denoted in an 
explicit immediate field of the instruction. 

1 0. The computer of claim 7, the instruction decoder being further designed to 
cease generating iterations when a termination condition of the macroinstruction is detected 
in the instruction pipeline. 

1 1 . The computer of claim 1 0, wherein the detection of termination comprises 
detecting a branch mispredict. 

12. The computer of claim 7, wherein the branch microinstruction is a branch 
instruction available to a program fetched from a memory of the computer. 

13. The computer of claim 7, wherein the branch microinstruction is generated 
carrying a marker indicating that the branch microinstruction defines a boundary between 
two successive iterations. 

14. A method comprising the steps of: 

decoding a macroinstruction on a computer, the macroinstruction calling for a number 
of iterations of a sequence of one or more microinstructions; and 

on detecting that an iteration completes operation of the macroinstruction, adding a 
marker indicating the end of the macroinstruction to a microinstruction in the pipeline 
downstream of the instruction decoder. 

15. The method of claim 14, further comprising the steps of: 

after reaching a termination condition of the iterations, partially executing an iteration 
beyond the termination, the partial execution committing at least one side-effect to an 
architecturally-visible resource of the computer, and raising an exception to transfer control 
to a second microinstruction stream; 
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in the second microinstruction stream, unwinding the side-effects committed by the 
post-termination iteration. 

16. The method of claim 14, further comprising the steps of: 

executing a microinstruction on the computer, the microinstruction storing into a 
memory location a value of a second instruction coded in the macroinstruction set; 

in response to the storing, clearing a memory system, including an instruction cache, 
and execution pipeline of the computer of the former content of the memory location; 

executing the second instruction in the execution pipeline. 

17. The method of claim 14, wherein the detection of termination comprises 
detecting a branch mispredict. 

18. The method of claim 1 7, wherein the mispredict is detected on a branch 
microinstruction architecturally available to a program fetched from a memory of the 
method. 

19. The method of claim 14, wherein the branch microinstruction is generated 
carrying a marker indicating that the branch microinstruction defines a boundary between 
two successive iterations. 

20. A computer, comprising: 

an instruction decoder designed to decode a macroinstruction set that includes a 
macroinstruction calling for a number of iterations of a sequence of one or more 
microinstructions; 

a pipeline stage downstream of the instruction decoder designed to detect that 
operation of the macroinstruction is complete, and in response, to add to a microinstruction a 
marker indicating the end of the macroinstruction. 
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2 1 . The computer of claim 20 : 

wherein the decoder, when decoding the macroinstruction, is designed to generate a 
number of iterations of: 

a pattern of microinstructions implementing a basic operation, and 
a branch microinstruction predicted not taken. 

22. The computer of claim 20, 

circuitry designed to execute an instruction calling for waiting to allow a pipeline to 
drain and to set bits of a floating-point control word to values denoted in an explicit 
immediate field of the instruction. 

23. A method, comprising the steps of: 

after reaching a termination condition of a loop of a first microinstruction stream 
executing in a computer, the microinstruction stream being generated by decoding a 
macroinstruction, partially executing a loop iteration beyond the termination, the partial 
execution committing at least one side-effect to an architecturally-visible resource of the 
computer, and raising an exception to transfer control to a second microinstruction stream; 

in the second microinstruction stream, unwinding the side-effects committed by the 
post-termination iteration. 

24. The method of claim 23, 

executing a microinstruction of a computer, the microinstruction storing into a 
memory location a value of a second instruction coded in a second instruction set; 

in response to the storing, clearing an instruction cache and execution pipeline of the 
computer of the former content of the memory location; 

executing the second instruction in the execution pipeline. 

25. The method of claim 23, further comprising the step of: 

executing an instruction that calls for waiting to allow a pipeline to drain and setting 
bits of a floating-point control word to values denoted in an explicit immediate field of the 
instruction. 
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26. The method of claim 23, further comprising the step of: 

ceasing to generate iterations when a termination condition of the macroinstruction is 
detected in an execution stage of an instruction pipeline of the computer. 

27. The method of claim 26, wherein the detection of termination comprises 
detecting a branch mispredict. 

28. The method of claim 23, wherein each iteration completes with a branch 
microinstruction architecturally available to a program fetched from a memory of the 
computer. 

29. The method of claim 28, wherein the branch microinstruction is generated 
carrying a marker indicating that the branch microinstruction defines a boundary between 
two successive iterations. 

30. A computer, compri sing : 

circuitry designed to partially execute a post-termination iteration of a loop of a first 
microinstruction stream executing in the computer, the partial execution committing at least 
one side-effect to an architecturally-visible resource of the computer, and to raise an 
exception to transfer control to a second microinstruction stream; 

software of the second microinstruction stream, programmed to unwind side-effects 
committed by the post-termination iteration. 

3 1 . The computer of claim 30, 

an instruction decoder designed to decode macroinstructions into microinstructions 
for execution in an instruction pipeline on a computer, and for at least one macroinstruction, 
the decoding of the macroinstruction generating a number of iterations of: 

a pattern of microinstructions implementing a basic operation, and 

a branch microinstruction predicted not taken. 
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32. The computer of claim 30, further comprising: 

a pipeline stage downstream of an issue buffer of the computer, designed to detect the 
termination condition of the loop, and in response, to add to a microinstruction a marker 
indicating the end of the macroinstruction. 

33. The computer of claim 30, wherein the loop of the first microinstruction 
stream consists essentially of instructions generated by decoding a single macroinstruction 
fetched from memory of the computer. 

34. The computer of claim 30, wherein an instruction generator of the computer 
joins iterations of the loop by branches predicted not taken. 

35. A method comprising the steps of: 

in a computer having instruction fetch circuitry for fetching instructions in first and 
second instruction sets from a memory of the computer and executing the instructions, 
executing a first instruction coded in the first instruction set, the first instruction storing into a 
memory location a value of a second instruction coded in the second instruction set, 

in response to the storing, clearing an instruction cache and execution pipeline of the 
computer of the former content of the memory location; 

executing the second instruction in the execution pipeline. 

36. The method of claim 35, 

decoding an instruction of the second instruction set, the decoding generating a 
number of iterations of: 

a pattern of instructions of the first instruction set implementing a basic 

operation, and 

a branch instruction of the first instruction predicted not taken. 
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37. The method of claim 35, further comprising the step of: 

executing an instruction calling for waiting to allow a floating-point pipeline of the 
computer to drain and to set bits of a floating-point control word to values denoted in an 
explicit immediate field of the instruction. 

38. The method of claim 35, an instruction decoder for the second instruction set 
designed to generate instructions in the first instruction set for execution in the execution 
pipeline. 

39. The method of claim 35, wherein the instructions in the execution pipeline are 
not tagged with an indication of an instruction set of origin. 

40. The method of claim 35, wherein the monitoring is based on comparing 
addresses in a physical address space. 

41. A computer, comprising: 

instruction fetch and execution circuitry designed to fetch and execute instructions in 
two different instruction sets, each instruction set including store instructions to write data to 
a memory of the computer; 

store monitoring circuitry designed to monitor the store instructions and to invalidate 
any copies of a datum overwritten by the store instructions, including copies of instructions 
in any instruction cache, in the instruction set other than the instruction set of the current 
store instruction. 

42. The computer of claim 4 1 , 

an instruction decoder designed to decode an instruction in a first of the instruction 
sets, and in response, to generate a number of iterations of a sequence of one or more 
instructions in a second one of the instruction sets; 

a pipeline stage downstream of the instruction decoder designed to detect that 
operation of the decoded instruction is complete, and in response, to add to a one of the 
generated instructions a marker indicating the end of the macroinstruction. 
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43 . The computer of claim 4 1 : 

wherein the instruction fetch and execute circuitry is further designed to partially 
execute a post-termination iteration of a loop within a first instruction in a first one of the 
instruction sets, the partial execution committing at least one side-effect to an architecturally- 
visible resource of the computer, and to raise an exception to transfer control to a second 
instruction stream in a second one of the instruction sets, programmed to unwind side-effects 
committed by the post-termination iteration. 

44. A method, comprising the steps of: 

decoding and executing an instruction on a computer, execution of the instruction 
comprising the steps of waiting to allow a pipeline to drain, and setting bits of a floating- 
point control word to values denoted in an explicit immediate field of the instruction. 

45. The method of claim 44, wherein instruction fetch and execution circuitry of 
the computer are designed to fetch and execute a macroinstruction set and a microinstruction 
set from memory. 

46. The method of claim 45, 

decoding a macroinstruction of the computer, the decoding of the macroinstruction 
generating a number of iterations of: 

a pattern of microinstructions implementing a basic operation, and 
a branch instruction predicted not taken. 

47. The method of claim 45, further comprising the steps of: 

decoding a macroinstruction on the computer, the macroinstruction calling for a 
number of iterations of a sequence of one or more microinstructions; and 

on detecting that an iteration completes operation of the macroinstruction, adding a 
marker indicating the end of the macroinstruction to a microinstruction in the pipeline 
downstream of the instruction decoder. 
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48. The method of claim 45, further comprising the step of: 

emitting the instruction as a microinstruction in response to a macroinstruction whose 
execution is dependent on a full/empty state of a floating-point top-of-stack. 

49. A computer, comprising: 

execution circuitry designed to execute an instruction calling for waiting to allow a 
pipeline to drain and to set bits of a floating-point control word to values denoted in an 
explicit immediate field of the instruction. 

50. The computer of claim 49, wherein instruction fetch and execution circuitry of 
the computer are designed to fetch and execute instructions in two different instruction sets. 

5 1 . The computer of claim 49, further comprising: 

each instruction set including store instructions to write data to a memory of the 
computer 

store monitoring circuitry designed to monitor the store instructions and to invalidate 
any copies of a datum in memory overwritten by the store instructions, including copies of 
instructions in the instruction set other than the instruction set of the current store instruction. 

52. The computer of claim 49, the instruction specifying individual bits of the 
floating-point control word to be written, in addition to values to be written to those bits. 

53. The computer of claim 49, 

the execution circuitry being further designed to execute an instruction calling for 
waiting to allow a pipeline to drain and to raise an exception based on a test of bits of a 
floating-point control word. 
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